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SIGNAL  DETECTION  FOR  SPHERICALLY  EXCHANGEABLE  (SE) 
STOCHASTIC  PROCESSES 


C.  B.  Bell 

San  Diego  State  University 


1.  Introduction  and  Summary 

Consider  signal  detection  in  the  following  context.  The  background 
pure  noise  (PN)  is  produced  by  nature's  choosing  a  parameter  value  and 
then  generating  a  time  series  according  to  a  (stochastic  process)  law 
with  that  parameter  value.  This  process  may  be  repeated  N  times.  If 
there  is  signal  present,  what  is  generated  is  a  (N  +  S) ,  noise-plus- 
signal, time  series.  The  statistical  properties  of  the  latter-type  of 
time  series  are  different  from  those  of  the  former. 

In  the  sequel,  one  will  be  concerned  primarily  with  the  case  in 
which  the  parameter  chosen  is  a  variance  and  the  time  series  subsequently 
generated  is  an  i.i.d.  zero-mean  Gaussian  time  series  with  that 
variance.  The  variances  are  chosen  by  nature  according  to  a  positive 
cdf,  corresponding  to  which  will  be  a  radial  distribution.  [See 


Appendix  II] . 

The  signal  detection  procedures  developed  below  will  be  valid  not 
only  for  the  (conditional)  Gaussian  model,  ft(S-S-E) ,  described  above. 
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but  also  for  a  wider  family  fi(K-S-E)  of  pure  noise  (PN)  stochastic 
processes.  This  is  because  for  all  S-E  (spherically  exchangeable) 
distributions,  the  set  of  observed  radii  {R^}  are  sufficient,  and 
the  set  of  "angles"  are  independent  of  the  radii  and  have  identical 
uniform  distributions.  [See  Section  2.C.  and  Appendix  II]. 

Five  different  signal  detection  models  will  be  explicitly  treated. 

Hie  treatment  of  these  models  will  indicate  how  a  variety  of  related 
signal  detection  problems  can  be  treated. 

The  paper  is  divided  into  ten  sections.  In  Section  2,  the  basic 
S-E  data  models  are  given.  Section  3  presents  the  five  detection 
problems  to  be  considered.  Statistical  preliminaries  are  presented 
for  historical  data  and  for  cross-sectional  data  in  Section  4.  Detection 
procedures  for  the  five  problems  are  developed  in  Section  5  through  9. 

Open  problems  and  concluding  remarks  are  given  in  Section  10. 

2.  Data  Structure  of  the  Background  Noise 

It  is  assumed  throughout  that  decisions  are  made  on  the  basis  of  a 
finite  number  of  observations.  Hie  actual  time  series  being  observed 
may  be  infinite,  but  one  only  u;*es  a  finite  initial  segment  in  the  case 
of  historical  data;  and  a  finite  number  of  finite  initial  segments  for 
cross-sectional  data.  In  each  case,  the  law,  L,  of  the  process 
determines  the  distribution  of  the  initial  segment,  and  vice  versa.  The  two  words 
are  sometimes  used  interchangeably. 

The  data  to  be  considered  is  of  the  form  Z, ,  Z-,  ..,  Z„,  where 
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hr  =  (Zrl’ '•••»  2rk}’  and  the  probabilistic  structures  of  interest  are 
given  below. 

(2A)  Normal  Spherical  Exchangeability  fl(N-S-E) 

2  ^ 

Zj,  ...»  Z^  are  i.i.d.  N(0,  a  i^),  i.e  ,  the  k  components 

2 

of  each  Z-vector  are  i.i.d.  W(0,o  ).  This  is  the  original  spherical 
structure  of  interest.  The  family  of  laws  of  all  such  time  series  is 
fl(N-S-E) . 

(2B)  Schoenberg  (1938)  Spherical  Exchangeability  fl(S-S-E). 

Wj,  . . . ,  WN  are  i.*.d.  H(-).  with  H(0)  =  0,  and,  conditionally 

given  =  wr»  *rl,  Xrk  are  i.i.d.  M(0,  l/wy). 


Definition  2.1.  H(*)  is  called  the  mixing  measure. 

Here,  one  can  say  that  each  ,  has  a  multivariate  distribution 

which  is  a  variance-mixture  of  zero-mean  normal  random  samples.  If 
H(*)  is  a  one-point  (i.e  ,  degenerate)  cdf  then  one  has  case  (2A) 
above.  [Some  relations  between  ft(S-S-E)  and  Gaussian  Markov  processes 
is  given  in  Appendix  II.].  The  family  of  laws  of  all  such  time  series 
is  O(S-S-E) . 

(2C)  Kelker  Spherical  Exchangeability,  fl(K-S-E) . 

b. *  &'•••'&  are  iUd-  where  W&  with 

(1)  R,  and  V.  being  independent;  (2)  R.  J(-),  where  J(0)  *  0; 
j  vi  3 

and  (3)  V.  •  (V.,,  ...»  V..)  being  uniformly  distributed  over  the 
*Xr  J  J 1  k  ’ K  k  Z 

hypersphere  S*  ■  {v:  I  vj  *  w^th  Rj  *  I  ZjS  ®nd  VjS  * 
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Definition  2.2.  J(*)  is  called  the  radial  cdf.  Some  examples 

of  radial  distributions  are  given  in  Aopendix  II. 

It  is  immediate  that 

Theorem  2.1.  fl(N-S-E)  C  fl(S-S-E)  C  Q(K-S-E) . 

Schoenberg  (1938)  essentially  proved  that 

Theorem  2.2.  For  infinite  time  series , 

fi(S-S-E)  =  fi(K-S-E). 

From  Lord  (1954);  Kelker  (1970);  Bell,  Avadhani  and  Woodroofe  (1970); 
and  Bell  (1975,  1978),  one  has 

Theorem  2.3.  Let  Z  =  (Z.,  .  . ,  Z.)  have  a  law  L  in  fl(K-S-E) . 

Then  for  1  <  m  <  k-1,  [/  Z2]  [  £  *  F(m,  k  -  m). 

"  “  1  j  m+1  j  m 

[This  is  Fisher's  F-distribution,  with  m  and  (k  -  m)  degrees 

of  freedom] . 

The  salient  point  here  is  that  there  are  non-Gaussian  laws  for  which 
the  F-distribution  is  valid. 

These  types  of  PN  structures  can  be  best  illustrated  by  the  following 
examples . 

Example  2.1.  fl(N-S-E) .  Let  Zrl  ...,  Z^  be  i.i.d.  W(0,5.4); 

Zy  *  (Zrj,  •••»  Zy,g) j  and  Zj,  ...,  Zjq  be  i.i.d.  The  Z^  here  are 

initial  segments  of  zero-mean  i.i  d.  Gaussian  time  series. 
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Example  2.2.  fi(S-S-E).  Let  Wj ,  . ..,  Wjq  be  i.i.d.  Ext,  (2.5). 
Further,  given  wr  =  wr»  let  zri»  *  *  zr8  be  conditionally  i.i.d. 

N( 0,  l/wr) .  Then,  the  unconditional  density  of  ZT  is 


f (z j ,  . . 


•  9 


exp  {-  y  I  z?)  (2.5)  exn  {-2.5w}dw 


(120) n~  [5.0  ♦ 


8,-5 

l  zh 

1  1 


and 


lv 


220 


are 


'V 

i.i.d.  F(-,  ...,  •)•  [See  Table  II. 1 

2  ?  2 

In  this  example,  each  R,  =  /  Z 

J  J  J5 


of  Appendix  II.]. 
and  the  radial  distribution 


J(-)  satisfies 

J(y)  *  P(R.  £y)  =  f  F*(yw2)  (2.5)  exp  {-2.5w}  dw, 

3  Jn 


where  F*(‘)  is  the  Xg-cdf. 


[Note:  If  H(-)  were  such  that  P„({5.2»  *  1,  then  f(z)  would  be 

g  n  <\» 

(10.4tt)_4  exp  {-  -j—  l  z2}  and  the  law  would  be  in  P(N-S-F).] 


Example  2.3.  fl(K-S-E) . 

Let  Z„  =  (Z„1#  ...»  Zrg)  and  Zj,  • • . ,  ^j5  b®  i.i.d. 


%r  v  rl 
with  density  of  the  form 


V*- 


,5,-2  5  , 

(  9  (|  zj)  ,  0  <  Z  zj  <  6^ 


fl(Zi,...,z5)  =  \ 


0,  otherwise 


•). 
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Then,  the  radial  distribution,  Jjf-}  is  l/( 0,0),  i.e., 

0  <  y  <6.  [See  Table  II. 1  of  Appendix  II.] 

If  the  common  density  for  the  Z's  is  of  the  form 


w 


ZS)  = 


k* 

*2 


5  .  -2 

ti 

1  3 


exp 


J{Cy) 


then  the  radial  distribution  satisfies 


J2(y)  =  1  -  exp  (-Xy),  y  >  0. 


[See  Table  II.  1.]. 

Statistical  detection  procedures  for  these  types  of  pure-noise  data 

involve  the  observed  radii,  R, ,  ...,  R„  which  function  as  sufficient 

1  N 

statistics;  and  the  "angles"  Vj,  . . . ,  VN  which  are  independent  of  the 

radii.  [The  V's  are  not  direction-angle  vectors  as  in  polar  coordinates. 
% 

The  latter  are  discussed  in  Appendix  II.] 

For  detection  problems  involving  specific  radial  distributions  one 
uses  the  {RJ,  while  for  detection  problems  involving  the  underlying 
spherical  structure,  the  {V.}  are  employed. 

The  signal  detection  models  to  be  treated  in  the  sequel  are  given 
in  the  next  section. 


3.  Some  Signal  Detection  Problems 

In  each  of  the  cases  below,  the  PN  (i.e.,  pure-noise)  distribution 
or  law,  L,  is  described.  Data  reflecting  a  different  stochastic 
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process  law  would  indicate  that  a  signal  is  being  received  (in  addition 
to  the  pure  noise) . 

3.1.  Detection  Problems  with  Historical  Data 

For  these  problems,  one  assumes  that  the  total  data  available  is 
Z  =  {Z  :  1  £  r  <  k).  (In  terms  of  the  preceding  section,  one  has  N  =  1) . 

The  problems  are  as  follows. 

PN. :  L((z))  £  fi(K-S-El 

i  ^ 

This  means  the  background  pure  noise  is  itself  S-E. 

PN_:  L({ Z  -  a})  e  fi(K-S-F)  for  some  (unknown)  a. 

For  this  case,  one  hypothesizes  that  the  pure  noise  is  S-E  about 
some  unknown  point  a,  possibly  not  zero.  If  "a"  is  known,  subtraction 
leads  to  PN^  above. 

Certain  other  detection  problems  can  only  be  handled  when  the  data 
is  cross-sectional,  i.e.,  the  data  is  of  the  ^orm  below. 

*  (h\ 

Z  -  :  where  Z,  =  (Z,, . ZA) 

\  <vN  • 

3.2.  Detection  Problems  with  Cross-Sectional  Data. 

PN3:  L( Z)  =  L0  e  fi(K-S-E) 

Here,  one  gives  a  specific  pure-noise  law.  Any  other  stochastic 
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behavior  of  the  data  indicates  signal  is  present.  Z. ,  ....  Z  are 

'V.l 

i.i.d.  and  the  associated  i.i.d.  radii  R, ,  ....  R  are  used  in 

l  n 

the  decision  procedures. 


PN.:  L({Z  -  a})  =  L.  e  fi(K-S-E)  for  some  a. 
4  %  U 


This  situation  refers  to  a  family  of  PN-laws  indexed  by  "a”, 
a  nuisance  parameter.  The  decision  procedures  will  involve  radii  modified 
by  "estimates"  of  "a". 

The  final  detection  model  to  be  considered  is  a  two-sample  model. 

The  observer  observes  two  independent  time  series  (X^)  and  {Y^}, 
and  the  pure-noise  situation  entails  their  stochastic  laws  being  equal. 
[See  Bell  (1964),  Model  II.] 


The  data  here  is 


where 


z  = 

(  <xn’ 

Xrk5’ 

for  1  <  r  £  m 

%r 

1  ^Yr! * 

....  Yrk) 

for  m  +  1  _<  r  <  N 

PNS:  LC(Xr»  =  L({Yr)) 


The  mechanisms  for  handling  these  five  detection  problems  (and  some  related 
problems)  are  given  in  the  section  below. 


-  9  - 


4.  Statistical  Preliminaries:  Sufficient  Statistics  and  Statistical 

Noise. 

For  developing  the  methodology  it  is  easier  to  consider  first  Jl(S-S-E) , 
since  the  family  consists  of  laws  involving  conditional  Gaussian  dis¬ 
tributions.  Nuisance  parameters,  and  M-S-S's  (minimal  si  cient 
statistics)  can  be  handled  in  traditional  ways.  Then  sii-.«.c  ..  "angles" 

(V. }  have  the  same  behavior  for  Q(N-S-E) ,  fi(S-S-E)  am  'K-R-E), 
the  decision  procedures  developed  can  be  extended  from  H(;.  .>-E)  to 
Jl(K-S-E) . 

One  recalls  that  the  above-mentioned  extension  is  not  necessary  if 
the  data  vectors  are  initial  segments  of  infinite  S-E  time  series.  For 
such  cases  (See  Theorem  2.2)  fi(S-S-E)  *  fi(K-S-E). 

To  these  ends,  one  now  develops  the  mechanisms  with  f2(S-S-E)  in  mind. 

(4A)  Basic  Statistics 
% 

Let  Z  be  a  data  matrix  with  law  L'  in  a  family  O’  admitting 

a  M-S-S  (minimal  sufficient  statistic)  S(Z). 

Definition  4.1  Let  6(Z)  -  (S(Z),  W(Z) ]  where  (1)  6(-)  is 

1-1  a.e.;  (2)  S(Z)  and  N(Z)  are  independent.  Then  (a)  6(-)  is 
called  the  BDT  (basic  data  transformation)  for  £)';  and  (b)  V(Z) 
is  called  the  M-S-N  (maximal  statistical  noise)  for  O' . 

(One  should  note  that  for  any  given  family  O',  S(Z),  W(Z)  and 
$(•)  need  not  be  unique.]. 

These  entities  are  used  in  constructing  families  of  detection  statistic' 


with  certain  desirable  properties. 
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(4B)  Distribution-Free  Statistics 

n, 

Definition  4.2.  (1)  A  statistic  T(Z)  is  NPDF  rt  ,  if 

'V 

there  exists  a  cdf  Q(-)  such  that  P{T(Z)  £y|i}  =  Q(v)  for  all 
y  and  for  all  L  in  fl' . 

(2)  A  family  of  statistics  {T*(Z ,i):  Lett'}  is  PDF  wrt  ft' , 
if  there  exists  a  cdf  Q*(-)  such  that 
n, 

P{T*(Z ,L)  <_  y  J  L }  =  Q*(y)  for  all  y  and  for  all 

L  in  fi' . 

A  rule  of  thumb  for  the  sequel  is  as  follows. 

4. A.  For  detection  problems  in  which  the  PN-distribution  involves 

a,  n, 

a  specific  law,  Lq  employ  statistics  of  the  form  T*fZ,L0)  =  iJj*[S(Z)  ,Lp] , 
This  entails  using  the  data  only  via  the  M-S-S.  Such  statistics  are 
PDF  wrt  Q’ . 

4.B.  For  detection  problems  which  are  concerned  with  the  general 

%  r\, 

structure  of  the  PN-Family,  employ  statistics  of  the  form  T(Z)  =  <KW(Z)). 
This  entails  using  the  data  only  through  the  M-S-N.  Such  statistics 
are  NPDF  wrt  f?' . 

F.xample  4.1.  Let  L,,  e  ft(S-S-E)  and  Z  =  (Z, ,  Z_) .  This  means 

- - -  n 

nature  has  chosen  W  =  w,  where  W  a,  H(-)>  and  given  W  =  w,  Z  ,  ...  Z 

7  17 

are  conditionally  i.i.d.  N(0,  — ) .  R  =  (£  z?)*^  is  the  M-S-S 

W  1  3 

for  H(-)»  which  determines  and  is  determined  by  L  (See  Bell,  et  al,  1970). 


v  -  (-J  ,  . . . .  is  M-S-N;  and 

6(Z)  =  (R,V),  where  S(Z)  =  R  and  N(Z)  =  V.  Now,  let 

a.  n.  %  % 

VP  *  V1  •  -£  and  VP  V  ■  V»- 

where  J^(y)  =  P(R_<  y|H)  =  P{ R  _<  y 1 L^},  i.e,,  is  the  radial  cdf. 

Then,  T, (•)  is  NPDF  wrt  ft(S-S-E)  with  cdf  O(z)  =  i  +  —  arc  sin  z 
I  i  TT 

M<5  .  T2(-,0  is  PDF  wrt  fi(S-S-E) . 

Now  for  fi(K-S-E),  R  is  the  M-S-S  for  the  radial  distribution, 

and  V  is  M-S-N.  Therefore,  T,  is  NPDF  wrt  P(K-S-F) ,  and 
%  i 

T2  is  PDF  wrt  fi(K-S-E). 

[Note:  For  analyses  somewhat  different  from  that  of  the  sequel,  one 
needs  a  polar  coordinate  model,  angular  distributions,  and  the  relations 
between  joint  densities,  radial  distributions  and  characteristic 
functions.  These  are  given  in  Lord  (1954);  Kelker  (1969);  Smith  (1971); 
Bell  and  Smith  (1970),  and  Ahmad  (1975),  and  in  Anpendix  II.]. 

(4C)  Extraneous  Statistical  Noise  (E-S-N) 

One  further  statistical  tool  involves  the  use  of  E-S-N.  The 
essentials  of  this  method  have  been  employed  by  several  authors,  e.g  , 
Durbin  (1961);  and  Bell  and  Doksum  (1965). 

Definition  4.3.  Let  Z  =  (Z. ,  Z.)  be  data  governed  by  law  L 

. . .  'Yj  1  K 

in  Q •  with  M-S-S,  S(Z);  M-S-N  W(Z) ;  and  BDT,  6(0 .  Let 

i\> 


1 2 


Y  =  (Y. ,  ....  Y.)  be  independent  of  Z  and  be  governed  bv  L,  in 

'V  A  K  '  1 

SI'.  Define  Y’  to  be  [Y! ,  ....  Y»]  =  6_1[S(Y),  W(Z)  ] . 

'V  IK  r\j 

(1)  Y  is  called  E-S-N  (extaneous  statistical  noise);  and 

(2)  Y*  is  called  R-S-N  (randomized  statistical  noise). 

Paralleling  the  proofs  of  the  aforementioned  authors,  one  can  prove 

d 

Theorem  4.1.  (Randomized  Noise  Theorem).  Y*  =  Y. 

■  ■  %  % 

This  result  illustrates  a  method  of  imposing  a  known  fusuallv  tractable), 
distribution  on  a  problem  in  which  the  cdf  of  the  data  is  unknown. 

It  is  particularly  useful  when  the  distribution  of  the  M-S-N,  N(Z) , 
is  relatively  intractable. 


Example  4.2.  Let  Z  =  (Z^,  ...,  Z 2q)  be  i.i.d.  W(0,u*)  where 

u*  >  0  is  unknown.  Let  Y  =  (Y. ,  ..  ,  Y„n)  be  independent  of  Z 

%  I  Zl/  % 

and  i.i.d.  N(0,1).  Then,  the  BDT  is  such  that  6(Z)  =  [S(Z),  W(Z)], 
I  12  2  %  Z.  *  ^ 

where  S(Z)  =  R  =./  7  Z.  ,  and  N(Z)  =  V,  with  V.  =  -A-  .  v  is 
%  V  ,  I  %  %  1  P  'v 


uniformly  distributed  over  the  hypersphere  S*(l) 


6  1[S(Y),  W(Z)]  =  (Z.: 

%  20  \  R  1 
>2  =  l  Y.  .  Then, 

1  J 


Z20>  ■  <Yi’ 


(R*r 


Y* 

rr 


Y' 

20’ 


are 


[See  Section  2.C], 
where 
i.i.d.  N(0  1). 


Y'  )  =  Y' 

20-*  l 


(4D)  Nuisance  Parameters  and  Kolmogorov-Smirnov  Statistics 
The  next  set  of  statistical  tools  of  this  section  involve  modified 
K-S  (Kolmocrorov-Smiraov)  statistics.  The  original  statistic  of  this 
class  was  (Kolmogorov  (1933)) 
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d(Fq!  n)  -  sup  iFn(z)  -  Fq(z)|,  where 
z 

1  n 

rn(z)  =  n  I  G  f-z  -  Z,.},  where  Z^,  . .  ,  z n  are  i.i.d. 

F0(-),  continuous;  and  e(u)  =  1,  if  u^O,  0,  if  u  <  0. 

For  several  of  the  problems  of  the  sequel,  FQ(-),  ^s  not  known 
completely .  It  is  known  to  be  a  member  of  a  family  of  cdfs ;  and 
hence,  there  is  a  nuisance  parameter  involved.  That  is,  F0(-)  e  £1"  = 
(F(-;0):  0  e  ©}. 

For  such  situations,  one  uses  extensions  of  the  ideas  of  Lilliefors 
(l'>67,  1969),  Srinivasnn  (1971)  and  Choi  (1980). 

Definition  4.3.  Let  Z=(Z. ,  ....  Z  J  be  i.i.d.  Fn  e  Q"  « 

'vi  n  0 

(F(-;6):  0e  ©},  where  Q"  admits  a  M-S-S,  S(Z) ,  and  a  MLE, 

A  % 

0,  of  0.  Let  Fn(y)  =  F(y;0)  for  all  y;  and  F  (y)  =  E(F  (y)|S(Z)), 
for  all  v. 

A  A 

(1)  Dn  =  sup  ( Fn(y)  -  Fn(y)|,  is  the  Lilliefors -type  statistic 
for  £)";  and 

(2)  r»n  =  sun  |Fn(v)  -  F  (y)  I  is  the  Srinivasan-type  statistic  for 

y 

fi". 


Example  4.3.  Let  Z  =  (Zj,  ....  ZJ5)  be  i.i.d.  W(0,6).  Then, 
the  M-S-S,  S(Z)  =  R,  and  the  M-S-N,  N(Z)  =  V,  where 

15  z  a, 

r2  ‘  £  zr  vi  *ir  •  l  -  <vi . viS)-  ">«>.  f15w  ■ 

5.  J,! 

°  15  R  ■ 


since 
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Further  F1S (y )  =  E{Fn(y)|f1}  =  P{Z^  _<  y  |R }  =  i  +  ^tan'1^)  for 

-R  <  y  <  R.  The  Lilliefors-type  statistic  is,  then. 


statistic  is 


a  -  y  ■J\^> 

«  sup|F^(v)  -  $0 — j^-)  !  ,  and  the  Srinivasan-^vDe 


*15  *_R^R  lF15(y:)  '  [I+  ?tan'1(^l 


Tables  for  both  of  these  statistics  are  given  in  Apnendix  I. 


Example  4.4.  Let  2=  (2,,  ....  Z„)  be  i  i.d.  N(9) ,  where 

- 1 -  %  1  9  g 

e  =  (p,  a2).  Then  S(Z)  =  (Z,  Sz)  ,  where  S2  =  ~  f  (Z^,  -  Z)2;  and 


p, 


z  -  z 

-  (-V- 


Z9  -  Z 

- )  =  (Wj ,  ....  ffg).  Here  Pg  - 


up  1 FQ  Cy )  -  ■  -I— •  )|.  This  is  the  statistic  studied  by  f.ill^efors  (1967). 

y  b_ 

y  z 


% 


D  =  sup  |  F_(y)  -  F  (y)|,  where  F  (•), 


is  given  by  Srinivisan  (1971),  and  is  given  explicitly  in  Example  6.2 
of  Appendix  I. 

A-  A, 

Wh nt  is  of  importance  here  is  that  each  of  the  statistics  D 

A  p, 

Dg  and  Dg  (of  the  preceding  two  examples)  is  a  function  of  the  data 
only  through  the  relevant  M-S-N.  In  fact,  one  ran  easily  derive 
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(a)  6.,  =  sup  1^  1  e  (x  -  V  }  -  *(X  /IS)  | 

x  1 1  3 


(b) 


D ,c  =  sup  U  l  e  (x  -  V  )  -  [j  ♦  i  tan'1x]|; 
1  -1<x<1  1  J 


15  - 


and 

9 

(c)  Dg  =  sup  |i  |  e  (x  -  Vj)  -  $(x) | 

% 

[The  derivation  for  Dg  is  not  so  straightforward.] 

The  final  statistical  tool  of  this  section  involves  the  use  of 
Helmert  matrices. 


(4E)  Helmert  Matrices,  M-S-S's  and  M-S-N. 

Definition  4.4.  An  (N  x  N)  square  matrix  =  {h.^}  is  called  a 
Helmert  matrix  of  order  N  if 


h.  .  = 
ij 


f  -A 

ft 


for 


1 


< 


V 


SHUT) 

rli-.U 

/i(i-l) 

0 


% 

It  is  clear  that  each  H^ 


i  =  N 

for  1  1  i  1  N  -  1 ,  1  <  j  £  i  - 

for  l<i£N-l,  j  =  i 

for  1  £  i  £  N  -  1,  i  +  1  £  j  £  N 
is  orthogonal,  and  that 


1 


Theorem  4.1.  If  Z  *  (Zj ,  . . . ,  ZN)  is  governed  by  a  law  L  in 
n(K-S-E)  and  X  =  Z  hL  then  X  »  Z. 

^  Ti  >\j  \ 

The  Helmert  matrix  can  in  some  circmstances  be  used  to  construct 
the  M-S-S  and  M-S-N. 
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Example  4.5.  (Gaussian  Random  Sample).  Let  Z  =  (Z .  Z_)  be 

%  1  5 

.  2  Vp 

H.  .  where 


»5  = 


d  let 

Y  = 

<V 

•  •  > 

V  =; 

1 

/ 2 

1 

/T 

0 

0 

0 

1 

1 

-2 

0 

n 

JE 

/6 

/6 

u 

l 

1 

1 

-3 

o 

/TT 

/lT 

/l2 

/rr 

1 

l 

1 

i 

-4 

.To 

✓To 

/To 

»To 

/To 

1 

l 

1 

l 

l 

/T 

/S 

/5 

vT 

2 

Then,  Y^,  Y4  and  Y,.  are  independent,  with  Y^  ^  W(0,  a  )  for 

1  _<  j  _<  4,  and  Y  N(y/T,  o ).  One  has,  then,  that  the  M-S-S  for  (y.cr*') 

2  4 

is  S  (Z)  =  (Y,.,  R*) ,  where  (R*)  =  £  y.  and  the  M-S-N  is 

*  Y  i  J 

W(Z)  =  CVj,  ....  V4)  where  V.  =  . 

One  can  now  treat  the  ^irst  detection  oroblem. 


5 .  S-E  Background  Noise. 

PN:  L  c  ft(K-S-E)  vs  N  +  Sj  J .  £  n(K-S-E) 

As  previously  mentioned,  the  technique  will  be  to  develop  the  methodology 

for  ft(S-S-E),  and  extend  its  validity  to  P(K-S-E). 

For  fi(S-S-E),  one  historical  "look"  yields  Z  *  (Z.,  ZM)  which 

-v  1  N 


I 


I 

I 

i 
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are  i.i  d.  N(0,  (w*)  ,  given  W  =  w*,  where  W  H(0»  with 

H(0)  =  0. 

Then,  one  has 


Theorem  5.1.  Conditionally  given  W  =  w*, 


(a) 

the 

M-S-S  is  S(Z) 

=  R; 

z. 

z. 

(b) 

' 

the 

M-S-N  is  N(Z) 

=  v  =  r— 

-v,  lR  * 

R  }‘ 

Cc) 

the 

BDT  is  fi(Z)  = 

(R,V);  and 

'V. 

(d) 

the 

MLE  of  w*  is 

—  RZ 

N  R 

From  these  entities,  one  derives  the  decision  rule  based  on  the 
relevant  statistics  of  Section  4. 

Decision  Rule  5.1:  Decide  N  +  Sj  iff 

Dn  =  sup  J  Fn (y)  -  $(£—•))  >  d(a,N)  . 

[Critical  values  d(a,N),  are  found  in  Table  A.l  of  Appendix  I.] 

Decision  Rule  5.2.  Decide  N  +  Sj  iff 

dn  =  sup  |FN(y)  -  [\  *  J  tan_1(^)]|>  d(a,N). 

y 

n, 

[Values  of  d(a,N)  are  given  in  Table  A. 2  of  Appendix  I.] 

Decision  Rule  5.3.  Decide  N  ♦  Sj  iff 
m  ,  N  ,  -1  M 

T  =  [l  Zt]  [  l  ZZ]  <  f'  or  >  f". 

1  3  m+1  J  a 


l 


18  - 


[The  f*-  and  f'-value*?  are  found  in  a  Fisher's  F-tahle  for  m 
and  (N  -  m)  degrees  of  freedom]. 


Decision  Rule  5.4.  Decide  N  +  if 


DN  =  SUP  I  N 
z 


N 

y  £  (z  -  Y!)  -  <J>Cz)|  >  d(a,N),  where  Y  =  (Y. , 
i  J  %  1 


is  E-S--N  and  Y'  =  (Y, ,  ....  YJ  is  R-S-N  as  in  Section  4 C.  [The 
%  1  N 

d(a,N) -value  are  found  in  a  standard  one-sample  K-S  table]. 

[Note:  If  the  data  available  is  cross-sectional  data,  one  might  make  the 

adaptations  in  decision  rules  suggested  at  the  end  of  Section  6], 

All  of  the  decision  rules  are  illustrated  in  Appendix  I.  Each 

example  bears  the  same  number  as  the  decision  rule  which  it  illustrates, 

e.g.,  Example  5.4,  illustrates  Decision  Rule  5.4. 

It  should  be  noted  that  each  of  the  decision  rules  of  this  section 

involves  the  data  solely  via  the  M-S-N,  V,  and  hence  is  NPDF.  Also 

% 

V  is  the  M-S-N  for  fl(S-S-E)  as  well  as  for  O(K-S-E) .  Hence,  one 
n, 

has 


A  V 

Theorem  5  2.  (1)  Each  of  the  statistics  D^,  D^,  T,  can  be 

*  % 

written  in  the  form  ty[V],  i.e.  D„  =  [V] ;  D„  =  i|>,[V], 

*\/  JV  1  'X/  11  4. 

T  =  tp_[V]  and  D'  =  <[>.  [V] . 
on,  n  4  ^ 

(2)  Hence,  each  of  these  statistics  is  NPDF  wrt  fi(K-S-E) . 

The  next  detection  p-^oblem  involves  two  nuisance  parameters  from  the 


point  of  view  of  0(S-S  E) . 
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6 .  Background  Noise  S-E  about  an  unknown  Point,  a. 

P^:  L((Z  '  £  ^(K-S-E)  for  some  point  a  vs 

N  +  S  L({z  -  a})  £  fi(K-S-E)  for  any  noint  a. 

L.  *\j 


From  the  Q(S-S-E)  point  of  view,  the  historical  data  is  Z  =  (Z 

1 

which  are  c.i.i.d.  N(a,  (w*)"*),  given  W  =  w*. 


Theorem  6 . 1 

(a)  M-S-S: 

(b)  M-S-N; 


The  basic  statistics  (conditionally)  are  then 
S(Z)  =  (X,S  ); 

r\,  A  — 

«(X)  V  -u.  U,  .  JJ-1 


(c)  BDT:  «(Z)  -  (X,  SY.  U);  and 

A  ^ 

(d)  MLE  of  (a,  w*) :  (X,  S^)  . 


Some  important  decision  rules  are  then  as  follows: 


Decision  Rule  6.1.  Decide  N  ♦  Sj  iff 

Dn  =  sup  |FN(y)  -  >  S(«,N). 

y  x 

[The  Lilliefors  (1967)  table  yields  critical  values]. 


Decision  Rule  6.2.  Decide  N  ♦  Sj  iff 

ty  %  % 

Dn  =  sup  |FN(y)  -  FN(y)l  >  d(a,n). 

y 

[Both  Fn(-)  and  the  critical  values  are  given  by  Srini^asan  (1970). 
See  Example  6.2  of  Appendix  I.] 


Decision  Rule  6.3.  Decide  N  +  iff 
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dn  =  sup  I  iSr  1 e  (y '  >  cJfct>n) 

y  l  J 

x.  -  x 

where  Y'.  =  Y  +  S  (-L - .) ,  and  Y  ,  ....  Y..  is  E-S-N. 

J  i  1  N 

[The  d(ot,n)  values  are  from  the  standard  one-sample  K-S  table]. 


Some  different  procedures  result  from  employing  Helmert  matrices  here. 


Theorem  6.2.  Let  Y  =  (Y^,  . ..,  Y^)  =  ?  .  Then  conditionally,  given 

W  =  w*,  one  has 


(a) 

(b) 

(c) 

(d) 

(e) 


Yj,  ....  Y^  are  independent; 

Y.  'v  V( 0,  1/w*)  for  1  <  i  <  N  -  1; 

Yn  *  Wfav^N,  4rl ; 

N-l 

(Y  ,  R*)  is  the  M-S-S;  where  R*  =  [  £ 

1 

V  =  (V  ,  . .,  V  ,)  is  the  M-S-N,  with 

A,  I  N~  1 


y2T1/2 

r 

z. 

v  =  — i* 

i  R*’ 


and 


(f)  f—  ,  7)  is  the  MLE  of  (a,  w*) . 

yfr  (R*) 

The  new  decision  rules  are  based  on  an  F-ratio  and  the  empirical 

i  N-! 

cdf  FN-1 C ^ '  where  FN  1(Z)  =  NT  ^  e  (z  -  Y.). 

j=l  J 


Decision  Rule  6.4.  Decide  N  +  S0  iff 

K-!  =  SUP  IFn-i(z)  -  >  d(a,N-l) . 

z 

[See  Decision  Rule  5.1,  for  critical  values]. 


Decision  Rule  6.5.  Decide  N  ♦  Sj  iff 
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°~N-1  =  SUp  ^ Fn-1 “  f7  +  ?  tan'A(^H  >  <*(ot,n  -  J)- 


1, 


[See  Decision  Rule  5.2  for  critical  values]. 


Decision  Rule  6.6.  Decide  N  +  iff 


°N-1  *  Sup 
z 


.  N-l 

jjTj-  l  E  (Z  -  xj)  -  *(z)|  >  d(ct,n  - 


1) 


where  X  =  (Xj,  ....  XN1)  is  E-S-N  and  X^  =  ~  and  (R**)Z  = 


N-l 


J  XT.  [See  Decision  Rule  5.4  for  critical  values]. 


Decision  Rule  6.7.  Decide  N  +  iff 


”*  .2,  fV  „2._1  ,N  -  m  -  1 


T*  -  [l  Yf]  [  I  Y‘]  (i 
1  J  m+1  J 


)  >  f»  or  <  f" 


[See  Decision  Rule  5.3  for  critical  values]. 

One  should  note  that  in  the  event  that  cross-sectional  data  is  available, 
some  adjustments  should  be  made  in  the  decision  rules  of  this  section  and 
the  preceding  section.  One  such  adjustment  is  suggested  by  the  following 
development,  based  on  Bell  (1975). 

Let  Z. ,  Z_,  . . . ,  Z„  be  i.i.d.  initial  k-segments  of  time  series 

<\jL  r\Ji 

with  common  law  L  in  Q(K-S-E) .  Then  one  has 

Theorem  6.3.  (1)  P((jZj,  *  ^11  ^ifc’  ^21*  S-E) 

l  N 

*  0;  and  (2)  7.)  is  S-E,  where  Z  ,  *  r?  \  Z  .. 

•1  ON  rs2  “1 

This  result  suggests  that  the  decision  rules  of  Section  5  and  6 
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can  be  applied  to  the  mean  vectors  of  cross-sectional  data. 

Numerical  examples  illustrating  all  of  these  decision  rules  are  given 
in  Appendix  I . 

With  so  many  decision  rules  being  considered,  it  is  natural  to  ask 
which  rules  are  better.  More  specifically,  one  might  ask  how  the 
decision  rules  based  on  the  Z's  compare  with  those  based  on  the 
Y's  of  this  section.  No  definitive  answers  are  kn«wn  at  this  time. 

[See  Section  10] . 

One  now  turns  to  detection  problems  for  which  cross-sectional  data 
is  necessary  and  available. 

7.  Goodness -of-Fit  Detection  Problems 

PN3  L  =  Lq  e  Sl(K-S-E)  vs  N  +  S3:  L  f<  l.Q 
It  is  convenient  to  write  the  data  in  matrix  form 


As  usual  one  views  the  problem  first  from  the  Sl(S-S-E)  viewpoint. 

This  means  that  W^,  ...,  l\'N  are  i.i.d.  H(-)  with  H(0)  =  0; 

and  Z..,  ...,  7...  are  c.i.i  d.  N(0,  [w*]  *),  given  W.  =  w*; 

J  JK  J  J  J 

for  j  =  1,  2,  ....  N. 

2  k  2 

One  now  defines  R7  =  J  Z.  :  V.  =  (V.,,  ...,  V..)  whe^e 

J  m=l  J®  J1  3* 
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vjm  '  ^  ;  ”d  S'2>’l  fE'2-V- 

3  1 

It  can  be  proved  that 

Theorem  7.1.  Condi t i onal ly ,  given  =  wt  for  1  _<  j  £  N, 

(1)  the  M-S-S  is  5(Z)  =  [R(l) ,  ....  R(N)],  the  ordered  radii; 

(2)  the  M-S-N  is  N(Z)  =  [R(R^) ,  ...  RfRN\  Vj,  •  • ,  VN] ; 

and 

(3)  Rj,  ....  RN  are  i.i.d.  GQ(-) ,  where  G0(z)  =P{RJ  <z|L0>, 

i.e.,  G0()  is  the  radial  cdf. 

A  pertinent  decision  rule  is  as  follows. 


Decision  Rule  7.1.  Decide  N  +  iff 


D  =  sup  |c  (z)  -  Gn(z)|  >  d(a,n). 


[See  Decision  Rules  5.4  and  6.6  for  rritical  values]. 

One  aspect  of  the  decision  rule  above  worth  knowing  is  the  exact 
relationship  between  Lq,  the  law  in  fi(S-S-E)  and  Hq(-)>  the 
rixing  measure  and  cdf  of  the  W's;  and  ^qC’»  •••,  *)  the  joint 
density  function  of  Z  =  (Zj,  . . . ,  ZN) .  It  can  he  proved  that 

Theorem  7  2.  (1)  GQ (z)  =  j°°  Nk(wz2)dHQ(z)  where  N^-)  is  the 


cpf  of  a  X^-distribution.  (2)  For  fixed  N,  each  of  Gp('),  Lq, 
HqC-)  and  fQ(-,  ...  0  is  determined  by  any  other.  [See,  e.g., 

Kelker  (1971),  Bell,  et  al  (1970),  Bell  (1975).]. 
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This  means  that  the  detection  problem  PN^:  L  =  LQ  is  equivalent 
to  PN*:  G  =  Gq5  and,  hence,  the  decision  rule  is  "completely  relevant." 
A  related  detection  problem  is  treated  in  the  next  section. 

8.  Goodness-of-Fit  With  a  Nuisance  Parameter. 

PNj  *-({Zr  -  a})  =  Iq  e  ft(K-S-E)  for  some  "a"  (possibly 

non- zero) 
vs 

N  +  S4:  For  each  a  L( {Z^  -  a»  t  LQ. 

%  %  *\j  'Vwjp 

For  the  data  matrix  2  as  in  Section  7,  one  forms  Y  =  Z  HN  * 


For  constructing  the  M-S-S  and  M-S-N  one  needs 


(a)  YJ  =  Y.k  for  1  <  j  <  N; 


2  ^r1  v2 

(b) 

(RJ)  =  l  Yis  ; 

J  S  =  1  J 

(c) 

Y*  =  [Y*(l) ,  .  .,  Y*(N) ] , 

the  ordered 

Y*'s. 

(d) 

R*  =  [R*(l) ,  ....  R* (N)l , 
% 

the  ordered 

R*'s; 

Y. 

Ji 

R*  ' 

1 

(e) 

V.  =  (V.r  ....  vkjk_1), 

where  V.  = 
J5 

One  has  then. 
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Theorem  8.1.  Conditionally,  given  IV..  =  w?  for  *  £  j  £  N, 

(a)  S(Z)  -S(Y).  [Y*.  R*] ;  and 

'V  'V 

(b)  N(Z)  =  N (Y)  =  [R(YJ)  R(Y*) ;  R(R*) ,  ....  R(R*); 

For  the  decision  procedure,  one  needs  the  cdf  of  RJ.  Call  it 
G*(-)  where  G*(z)  =  P{RJ  <  z] . 

Theorem  8  2.  (a)  G*(-)  determines  and  is  determ  ;d  by 

(b)  R*,  ...»  R*  are  i.i.d.  G* 

The  decision  rule  is  then 


Decision  Rule  8.1.  Decide  N  *  iff 


dn-i  =  sup  1  n^T 

Z 


N-l 

l  e  (z  -  R?)  -  G*(z)|  >  d(o,M  -  1) . 


This  corresponds  to  a  goodness-of-fit  test  for  the  modified  radii 

{R* } .  One  could  have  used  any  one  of  a  number  of  other  goodness -of- fit 
J 

statistics,  e.g.,  Cramer-von  Mises.  However,  in  this  work,  one  uses  con¬ 
sistently  the  Kolmogorov-Smimov  statistics. 

The  final  detection  problem  to  be  considered  here  is  a  two-sample 
problem. 


9.  Signal  Detection:  A  Two-sample  Problem. 

PN$ :  Lj  =  L2  e  fi(K-S-E)  vs  N  +  Sg:  L1  ?  L2 
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Hie  data  matrix  here  is 


7  = 


'lk 


mk 


lk 


nl. 


A 

/ 


where  N  =  m  +  n. 

The  relevant  statistics  are 


f  k  , 

\  X.  for  1  £  i  <  m;  and 
s=l  JS 

1  k  2 

I  l  Y.  for  m  +  1  <  ]  £  N. 

1  s=l  38 


One  now  has 

m 

Decision  Rule  9.1.  Decide  N  +  S,.  iff  £  R(R.)  aj  or  £  a2' 

[The  table  here  is  that  for  the  Mann  Whitney-Kilcoxon  Rank-Sum  Statistic]. 

Decision  Rule  9.2.  Decide  N  S^  iff 
m  .  N 

D(m,n)=  sup  |±-  l  e(z  -  R.)  -  -  ]>  £  (z  -  R  )|  >  d"(a,m,n) 

z  m  1  3  n  m+1  3 

[The  table  here  is  that  of  the  two  sample  K-S  statistics]. 
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Again  a  large  number  of  nonparanetric  (NP)  statistics  are  available 
for  this  detection  problem.  The  two  chosen  above  are  representative  and 
have  some  optimal  properties 

The  preceding  developments  lead  to  some  interesting  observations  and 
speculations . 

10 .  Concluding  Remarks  and  Open  Problems. 

(A)  Parametric  and  NP  Statistics 

Signal  detection  problems  related  to  SE  time  series  utilize 
parametric  as  well  as  NP  techniques.  The  snhericity  property  allows  one 
to  employ  the  F-statistic  and  othe1'  classical  statistics;  while  the 
families  of  rndial  distributions  are  NP  ir.  character.  Hence,  one 
could  have  used  a  variety  of  goodness-of-fit  and  NP  statistics.  TTie 
relative  utilitv  of  these  other  statistics  is  at  this  time  an  open 
question.  [See  (B)  below]. 

(B)  FDR  and  Power.  The  procedures  given  in  the  text  are  reasonable 
and  commonly  used.  However  very  little  precise  information  is  available 
on  FDR  and  power.  A  series  of  definitive  studies,  evaluating  FDR  for 
reasonable  (N  +  S)-distributions ,  is  needed.  Some  preliminary  Monte 
Carlo  results  have  been  relatively  costly  and  less  than  definitive.  As 
of  this  writing,  it  is  difficult  to  say  which  of  the  available  procedures 
is  better  than  which  others.  In  particular,  one  asks  how  do  the  pro¬ 
cedures  based  on  the  Z's  compare  with  those  based  on  the  Y's  in 


Section  6. 
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(C)  Polynomial  Drift.  Techniques  were  presented  above  for  time 
series  S-E  about  0  and  about  non- zero  points  "a".  Of  some 
interest  would  be  background  noise  which  is  SE  about  some  polynomial 
curve.  This  would  be  related  to  polynomial  regression  with  SE 
(rather  than  Gaussian)  errors  and  should  be  also  related  to  some 
recent  robustness  studies. 

(D)  Guassian  Markov  Processes ■  It  is  developed  in  Appendix  II, 
that  S-E  time  series  can  be  considered  "imbedded"  in  certain  mixtures 
of  some  Gaussian  Markov  processes.  It  is  an  onen  problem  as  to  how 
the  very  rich  literature  on  Gaussian  Markov  processes  can  be  brought 

to  bear  on  signal  detection  problems. 

(E)  Polar  Coordinates .  Some  explicit  detail  about  the  polar 
coordinate  aspects  of  S-E  times  series  is  given  in  Appendix  II.  The 
current  interest  in  geometric  probability  may  give  some  new  insights 
into  developing  signal  detection  procedures  different  from  the  enes 
presented  in  this  paper. 
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APPENDIX  I 


1 .  Tables 

2 .  Data  Sets 

3.  Numerical  Examples 

(The  tables  were  computed  by  S.  M.  Lee.  He,  A.  Mason,  and  G.  Muse  did 
the  computations  for  the  numerical  examples.) 
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TABLE  A. 1 .  Critical  Points  "or  D  (See  Decision  Rule  5.1). 

n 

A 

[Dn  a  modified  Lilliefor-=-tyne  K-S  statistics]. 


Sample 


Size 

.01 

.05 

.10 

.15 

.20 

.25 

2 

0.8369 

0.8202 

0.7968 

0.7697 

0.7393 

0.7062 

3 

O^-’S 

0  7217 

0.6566 

0.5992 

0.5507 

0  5294 

4 

0.7335 

0.6210 

0.5650 

0.5289 

0.4985 

0.4701 

5 

0.6603 

0.5665 

0.3074 

0.4695 

0.4403 

0.41R2 

6 

0.6071 

0.5140 

0.4639 

0.4289 

0.4002 

0.3790 

7 

0 . 5663 

0.4"66 

0.4288 

0.39  [1 

0.3752 

0.3534 

8 

0.5337 

0.4497 

0.4048 

0  3739 

0.3511 

0.3322 

9 

0.5048 

0.4250 

0.3823 

0  3530 

0.3318 

0.3137 

10 

0.4772 

0.4005 

0.3609 

0.3351 

0.3145 

0.2978 

11 

0.4659 

0.3866 

0.3458 

0.3201 

0.2999 

0.2836 

12 

0.4438 

0.3-09 

0.3318 

0.3069 

0.2886 

0.2725 

13 

0.4278 

0.3577 

0  3198 

0.2964 

0.2774 

0.2622 

14 

0.4095 

0.3411 

0.3067 

0.2838 

0.2656 

0.2513 

15 

0.3973 

0.3310 

0.2974 

O  2753 

0.2589 

0.2450 

16 

0.3872 

0.3215 

0.2875 

0.2674 

0.2513 

0 . 2374 

17 

0.3769 

0.3130 

0.2819 

0  2599 

0.2439 

0.2301 

18 

0.3690 

0.3024 

0.2712 

0.2511 

0.2356 

0.2226 

19 

0.3567 

0.2966 

0.2655 

0.2455 

9.2311 

0.2183 

20 

0  3479 

0.2896 

0.2598 

0.2408 

0.2260 

0.2137 

21 

0 . 3369 

0.2820 

0.2521 

0  2333 

0.2193 

0.2075 

22 

0 . 3344 

0.2781 

0.2473 

0.2284 

0.2141 

0.2023 

23 

0.3189 

0.2704 

0.2415 

0.2233 

0  2097 

0.1986 

24 

0  3172 

0.2627 

0.2361 

0.2183 

0 . 2048 

0.1938 

25 

0.3082 

0  2558 

0.2306 

0.2139 

0.2015 

0.1906 

26 

0 . 3054 

0.2551 

0.2266 

0  2095 

0.1968 

0.1863 

27 

0.3016 

0.2500 

0.2244 

0.2076 

0.1943 

0.1838 

28 

0.2903 

0.2444 

0.2189 

0.2031 

0.1906 

0.1803 

29 

0.2897 

0.2416 

0.2169 

0.2001 

0  1879 

0.1782 

30 

0.2843 

0.2356 

0.2121 

0.1975 

0.  853 

0.1748 

30 


0.6700 
0.5107 
0.4-126 
0.3969 
0.3615 
0.3355 
0.3159 
0.2981 
0.2825 
0.2695 
0.2586 
0.2491 
0.2387 
0.2328 
0.2261 
0  2186 
0.2115 
0.2078 
0 . 2029 
0.1973 
0.1927 
0.1888 
0.1847 
0.1809 
0.1774 
0.1750 
0.1718 
0.1689 
0.1662 


[Table  computed  by  S.  M.  Lee,  20,000  repititions] 


TABLE  A. 2.  Critical  Values  of  D  (See  Decision  Rule  5.2). 
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Data  Sets 

(The  data  sets  herein  were  generated  as  described  in  Section  1,  and  in 
Section  2.C.  The  actual  laws  of  the  underlying  stochastic  processes  are 
in  fl(S-S-E). 
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Data  Set  B. 


(Ten  independent  random 


xlj 

x„. 

23 

X3j 

1.0784 

1.5952 

9.2919 

0.5370 

0.9007 

0.1444 

0.5882 

1.6478 

2.9246 

1.3147 

2.9117 

4.0288 

4.3221 

2.0311 

1.2146 

0.5331 

7.4930 

2.1394 

5.5134 

1.8511 

2.5914 

0.5144 

2.6477 

7.3539 

1.1833 

0.3628 

1.3510 

0.5530 

0.2034 

1.2320 

amples  of  size  6  from  an 


X,  • 

X53 

X6j 

1.7778 

1.1270 

0.2590 

1.5916 

1.6744 

1.5387 

4.0697 

0.3525 

2.1723 

1.0712 

0.2252 

1.6121 

0.1451 

5.2428 

1.8572 

9.8953 

1.4496 

0.4987 

3.7042 

2.4613 

0.8480 

1.5439 

1.2565 

1.5495 

0.2951 

7.6608 

5.6672 

0.7505 

0.7187 

5.2019 

Exp  (0.5)  population) 


X 

®X 

2.5215 

3.3584 

1.0644 

0.6366 

1.9591 

1.4134 

1.8606 

1.3757 

2.4688 

1.9322 

3.6681 

4.0134 

2.8282 

1.6151 

2.4776 

2.4853 

2.7473 

3.1286 

1.4432 

1.8711 

Data  Set  C. 

(Ten  independent  random  samples  of  size  6  from  Exp  (0.5)  population). 
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Numerical  Examples  for  Section  5. 

Pt^:  L  €  fi(K-S-E)  vs.  N  +  Sj :  L  $  fi(K-S-E) 

5.1  Modified  Lilliefors 

5.2  Modified  Srinivasan 
5.3.  F-test 

5.4  E-S-N 


The  examples  here  are  numbered  corresponding  to  the  decision  rules  of 
the  text. 
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Example  5.1. 


Le  fl(K-S-E)  vs.  N  +  Sx s  Le  O(K-S-E) 


1 

2. 38000E-2 

2 

9. 20000E-2 

3 

-1.73000E-2 

4 

-8.65000E-2 

5 

1.07300E-1 

6 

1.73000E-2 

7 

-2.77500E-1 

8 

5.77000E-2 

9 

-1.36800E-1 

10 

2. 03100E-1 

11 

-1.61000E-2 

12 

5.77000E-2 

13 

5.54000E-2 

14 

7.24000E-2 

15 

3.72000E-2 

16 

-1.01300E-1 

17 

-6. 78000E-2 

18 

1.25300E-1 

19 

1.06800E-1 

Decide  N  +  iff 

Bn  ■  sup  |fn(v)  -  *<^p)  |  >  d(o,N) 

From  Table  A.l,  one  finds  d (.05,19)  *  0.2966.  Hie  obtained  statistic 
value  is  D19  **  0.195.  Therefore,  one  decides  PN. 
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Example  5.2. 

PNX:  Le  Q(K-S-E)  vs.  N  +  L  {  Q(K-S-E) 


Data:  1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 


1.76000E2 
1.88000E2 
7.11000E2 
7.32000E2 
6.10000E2 
1. 00800E3 
1.02400E3 
1.19500E3 
7 . 36000E2 
1.19600E3 
1.18100E3 
1.04400E3 
1 . 24500E3 
1.50600E3 
1.57600E3 
1.34600E3 
1.10300E3 
1. 29600E3 
9.81000E2 
1.44000E3 


Decide  N  +  iff 


Dn  «  sup  kN(y)  -  [y  +  7  tan”1  (jj)J  J  >  d(a,N) 

y 


From  Table  A. 2, 
x 

value  is  D2Q  ■ 


one  finds  d(.05,20)  *  0.4251.  The  obtained  statistic 
0.512.  ttiecefore,  decide  N  +  S. 
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Example  5.3. 


PNi:  L  €  vs.  N  ♦  Sj:  L  «f  « CK-S-E^ 


Data: 


Z 

a. 


[6.9707,  6.987? ,  7.0979,  6.996’  7-0322,  6.9174,  6.9S23, 

7.0079,  7.0597,  7.0263] 


F-statistic 


5  2 
h 

*r  —  A 

246.202 

1.007 

10  7 

U] 

6  1 

244.496  * 

Decide  N  +  iff 

T  >  F 

(5, 5,. 975) 

=  7.1464 

or 

T  <  F 

(5,5, .025) 

=  .1399 

Tli  ere  fore, 
decide  PN. 

Example  5.4. 


PNj:  L  e  fi(K-S-E)  vs.  N  +  S^:  L  £  fl(K-S-E) 

Data: 

Z  -  [6.971,  6.988,  7.098,  6.996,  7.032,  6.917,  6.952, 
7.008,  7.060,  7.026] 


-  45 


Generate  Y  =  (Y, ,  . ..,  Y,rt)  i.i.d.  $  independent  of  Z. 

v  i  w  *v» 

Y  *  [-0.511,  '0.884,  0.307,  -1.169,  1.146,  -1.107,  0.140, 
-0.168,  1.698,  0.143] 

10  -  1/2 

R,  =  C  [  ZT)  =  22.152 

t  1  3 


*  [0.890,  0.894,  0.909,  0.895,  0.900,  0.885,  0.890, 
0.897,  0.904,  0.899] 

Test  statistic: 

i  10 

°N  =  sup  llo  £  e  (z  *  V  " 

z  X 

*  nax  (nax[$(z)  -  — j_A,  yg-  * 
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k 

v  00 

*(V(k)) 

_ 

*(V(k))- 

k  -  1 

10 

k 

10 

*(V(k)) 

■n 

. 

■US 

- 

0.890 

0.894 

.8143 

.5143 

0.895 

.8146 

.4146 

- 

0.897 

.8151 

.3151 

- 

0.899 

.8156 

.2156 

- 

8 

0.900 

.8159 

.1159 

- 

9 

0.904 

.8170 

.0170 

.0830 

10 

0.909 

.8183 

— 

.1817 

DJq  =  .8120  >  .410  (a  =  .05) 
Decide  N  +  S 


Numerical  Examples  for  Section  6. 

PN,:  L  ({Z  -  a})  e  fl(K-S-E)  vs.  N  +  S,:  L({Z  -  a»  t  fl(K-S-E) 

Z  %  Z  f\f 


6.1 

Lilliefors 

6.2 

Srinivasan 

6.3 

E-S-N 

6.4 

Helmert  modified 

Lilliefors 

6.5 

Helmert  ■)  modified 

Srinivasan 

6.6 

Helmert  *■>  E-S-N 

6.7 

Helmert  F-Test 
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PN  •  L({Z  -  a})£  ft(K-S-E)  for  some  a 
N  +  S_:  L({z  -  a})  $  fl(K-S-E)  for  any  a 

2  Or 


Data  Set  D  was  used: 

1st  case:  historical  data  ("first  look") 
2nd  case:  cross-sectional  data  (means) 


2.  Determine  order  statistics  X(l),  X(2),  ....  X(n). 

3.  Compute  Lilliefors  statistic 

Dn  -  sup  |PN(«)  - 

Z  A 

*  max  (max  [*(^g — -)  ■  ”  »  ft  “  *(* 

4.  Decide  N  +  S  iff  D^  >  d(a,N)  where  d(a,N)  is  obtained  from 


Lilliefors'  table. 


<<Q 
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Case  1 :  Lilliefors  method  (historical  data) 
Data  Set  D  (1st  "look") 


Xx  =  6.3290 


S 2  =  1.7008 
*1 


SY  =  1.3041 
*1 


w- r 1.  i 

k  Z(k)  "  1  SXJ  *(w)  *(w)-  ^ 


■■ 

3.4160 

1  -2.2337 

.0127 

.0127 

6.8105 

.3692 

.6440 

.4773 

6.8522 

.4012 

.6600 

.3267 

B 

6.9583 

.4826 

.6855 

.1855 

6.9661 

.4885 

.6870 

.0203 

B 

6.9707 

.4921 

.6895 

=  .4773  >  .319  (a  =  .05)  Decide  N  ♦  S. 


Case  2:  Lilliefors  method  (cross-sectional  data) 
Data  Set  D 


*  6.9006 


s|  «  .2349 


«  .4846 


Dn  *  sup  |Fn(2) 
z 


=  max  {max  [$(Z  -~—  ) 
k  bX 


k  -  1  k 
10  '  10 


-  $(w) 


.1540 


.1463 

.3105 
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t 


k 

Z(k) 

A 

4>(w) 

♦W-  ^ 

To  * 

6.1475 

-1.5664 

mm 

n 

mam 

6.3290 

-1.1919 

KSSm 

6.4858 

-  .8684 

.3070 

.1070 

- 

6.7568 

-  .3088 

.3790 

.0790 

.0210 

6.7813 

-  .2586 

.3980 

- 

.1020 

6.8378 

-  .1415 

.4435 

- 

.1565 

7.2096 

-  .6252 

.7340 

- 

8 

7.3357 

.8120 

- 

- 

9 

7.5368 

1.3005 

.9035 

- 

10 

7.6456 

1.5249 

.9365 

.0365 

.0635 

D10  =  .1565  <  .258  (a  =  .05)  Decide  PN 


Example  6.2. 

Srinivasan  (1970) 

Hie  Srinivasan  statistic  is 

.  'v  - 

Djj  =  sup  |Fn(z)  -  Fq(z)  | 
z 

where 

F„(z)  is  the  empirical  distribution  function 
N 

Fq(z)  *  E[Fn(z)|M-S-S];  and  the 
M-S-S  is  S(X)  *  (X,  Sy).  Hence 

*\»  A 

'V  ,  fU  _  7 

FqCujw.o^)  «  ffxJX,  Sj)dx, 

'  —  oo 
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where 

f (Xj I  X, 
^0(u;y,o2) 


1/2 


kn  -  1J 


rc|[n  -  1]) 
TtrCyCn  -  2]) 


'V(u) 

g(y)dy ; 
o 


n-4/2 


g(y) 


TCn-2) 


{r(|[n-2])}: 


n/2-2 


(i  -  y) 


n/2-2. 


and 


V(u)  =  i  (1  +  ( 


n  -  1 


1/2 

0  )■ 


Computation  of  Srinivasan  statistic  for  n  *  10 

Given  a  set  of  historical  data:  Xj ,  Xj,  ...»  X1Q  one  wishes  to  test 

PN7 :  L({X  -  a})  e  Jl(K-S-E) 

^  'v, 

(i.e.,  S-E  about  some  unknown  a). 

1.  Compute  order  statistics  X(l),  X(2),  ...»  X(10) 

2.  Compute  X  and  S^. 

1/2  — 

3.  Evaluate  V(u)  =  j  {1  +  (^j-)  (^g — — )) 

with  u  =  X(j)  and  n  =  10  for  j  =  1,  ....  10, 

i.e.,  V[X(j)]  =  \  (1  ♦  ^  [X  ~s*W-]}  for  j  a  1 ,  ....  10 


4.  Evaluate 


,Oc><? 
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Fo[X(j);y,02]  =  1  -  (V[X(j)]}4  [35  -  84  V[X(j)] 

♦  70  {V[X(j)]>2  -  20{V[X(j)]}3] 

=  max  |-j^-  -  F_[X(j)  ;u,o2]  | 
l<j<10 


_  2  him. 

X  X(j)  X  -X  (X  -X)^  S  V[X(j)] 

_ , _ ) _ ,  A _ _ _ _ 


-.4979 

-.4979 

-.2504 

.0627 

1.2743 

1.1716 

-.4105 

-.4788 

-.1630 

1.1771 

1.1204 

-.0182 

-.4105 

.2293 

.0526 

.8295 

.9372 

.0388 

-.3687 

.2863 

.0820 

.6168 

.8251 

-.2179 

-.3322 

.0296 

.0009 

.4310 

.7272 

-.2353 

-.2353 

.0122 

.0001 

-.0621 

.4673 

.0454 

-.2179 

.2929 

mmM 

-.1506 

.4206 

-.4788 

-.0182 

-.2313 

mmm 

-1.1669 

-.1150 

-.3322 

.0388 

-.0847 

-1.4570 

-.2679 

-.3687 

.0454 

-.1212 

.0147 

-1.4906 

- . 2856 

o<?<? 
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[V(X(j)]]‘ 


[vcx(j)]]3  [V(x(j)]]4 


P0[X(j)] 


1.3726 

1.6082 

1.8842 

.0511 

1.2553 

1.4064 

1.5758 

.0082 

.8783 

.8232 

.7715 

.0031 

.6808 

.5617 

.4635 

.0203 

.5288 

.3846 

.2797 

.0940 

.2184 

.1020 

.0477 

.5709 

.1769 

.0744 

.0313 

.6693 

.0132 

-.0015 

.0002 

.9909 

.0718 

-.0192 

.0052 

.6728 

.0816 

-.0233 

.0067 

.5634 

Decide  N  +  S  iff 

%  Jfc 

1Q  >  d( .05 , 10) =0 . 24  .  D,q=  .4366-  Decide  N  ♦  S. 


E~S~N  [Durbin  (1961)].  Tlie  procedure  is  as  follows. 


-  F0[X(j)] 


.0489 

.1918 

.2969 

.3797 

.4060 

.0291 

.0307 

.2272 

.4366 


1 .  Compute  X  and  . 

2.  Generate,  by  standard  methods, 

E-S-N:  Yj,  Y2,  ...,  Yn  idd  4>  independent  of  X 

—  ^Y  — 

3.  let  Y’  =  Y  +  (X.  -  X) 

J  5X  3 

4.  If  Xj,  X2,  ....  XN  e  fl(K-S-E)  then  (YJ,  Y’,  ...,  Y^)  =  (Yj ,  Y. 

i.e.,  Y’.Y’  ,  ....  Y-  are  i.i.d.  *. 


V 


5.  Determine  order  statistics  Y'(1),Y'(2),  ...,  Y'(N) 
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6.  Use  Kolmogorov-Smirnov  test: 

D»  =  sup  |F  (2)  -  F0(z)| 
z 

=  max  [max  (4>[Y'(k)]  -  —  ,  -  -  $[Y'(k)]}] 
k  n  n 

7.  Decide  N  +  S  iff  >  d(a,N)  where  d(cx,N)  is  value  from 
Kolmogorov-Smirnov  tables. 

On  applying  the  above  procedure  to  Data  Set  D,  one  finds: 

-  6.9066;  S^-  =  .2349;  =  .4846; 

M-S-N:  N(X)  =  [-1.5664,  -1.1919,  -.8684,  -.3088,  -.2586,  -.1415 

'V 

.6252,  .8854,  1.3005,  1.5249]; 

E-S-N:  Y  =  [.807,  -1.306,  .875,  -.081,  -1.056,  1.241,  1.126,  -.397, 

% 

.244,  1.207];  and 

Y  =  .266  =  .8019  Sy  =  .8955 

Y |  =  .266  +  (.8955) (-1.5664)  =  -1.1367 

Y^  =  -.8013  Y^  =  -.5117  YJ  =  -.0105 

Y£  =  .0344  Y£  =  .1393  Y}  =  .8259 

Y£  =  1.0589  Yg  =  1.4306  Y|Q  =  1.6315  . 
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rhe  computations  for  the  K-S  test  are  as  given  below: 


k  Y'OO  *[Y'(k)]  .  *[V(k)]-^i  Yo-*[Y'(k)] 


1 

-1.1367 

.1280 

.1280 

. 

2 

-  .8013 

.2080 

.1080 

3 

-  .5117 

.3040 

.1040 

- 

4 

-  .0105 

.4940 

.1940 

- 

5 

.0344 

.5135 

.1135 

- 

6 

.1393 

.5550 

.0550 

.0450 

7 

.8259 

.7955 

.  19S5 

- 

8 

1.0589 

.8550 

.1550 

-  . 

9 

1.4306 

.9238 

.1238 

- 

10 

1.6315 

.9487 

.0487 

.0513 

D"0  =  .1955  <  .410  (a  =  .05)  Decide  PN. 

Examples  6.4  -  6.^7 

PN„ :  L({z  -  a})  e  fi(K-S-E)  for  some  a. 

2  n, 

vs . 

N  +  S_:  L({ Z  -  a})$  Jl(K-S-E)  for  any  a. 
i  -v 

For  these  four  examples  one  multiplies  the  original  data  vector  by  the 
Helmert  matrix  of  appropriate  dimension  and  reverts  to  the  methods  of 
Examples  5.1  -  5.4. 


Original  data:  Z  The  Data  Vector,  after  being  multiplied 

by  the  Helmert  Matrix  size  19  is: 


1 

5 . 33300E-1 

2 

-7 . 51000E-2 

3 

6.25200E-1 

4. 30204E-1 

4 

9. 73000E-2 

-3.23414E-1 

5 

-4.60300E-1 

2.28486E-1 

6 

-1.43000E-2 

6.75717E-1 

7 

-2.29600E-1 

1.44580E-1 

8 

-4.33000E-2 

3.21522E-1 

9 

-1 . 74400E-1 

1.04178E-1 

10 

-3.57700E-1 

2. 15479E-1 

11 

1 . 70900E-1 

3.66623E-1 

12 

-4.59800E-1 

-1.72377E-1 

13 

6.S4100E-1 

4.46492E-1 

14 

5.53200E-1 

-6.59488E-1 

15 

1 . 18600E-1 

-5 . 13337E-1 

16 

-3.62000E-1 

-5.80276E-2 

17 

4.65700E-1 

4. 11059E-1 

18 

-4.56600E-1 

-4.16864E-1 

19 

5.39000E-1 

5.03291E-1 

-4.92980E-1 

2.57909E-1 

Example  6.4. 

Decide  N  +  S  if 


Dn1  =  sup  |F*_1(x)  -  M-gP)|  >  3(a,N  -  1) 
z 

From  Table  A.l,  one  finds  d(.05,  18)  =  0.3024.  The  calculated  value  jc 
the  statistic  is 


°18  -  °’214 


Therefore ,  decide  PN 


<?•< 
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Example  6.5. 

Decide  N  +  S  iff 

Dn_x  =  sup  IF'.jCz)  ♦  [\  -  J  tan'1^)]!  >  d(a,N  -  1) 


From  Table  A.  2,  one  finds  d(.05,  18)  =  0.4233.  Hie  calculated  value  of 


the  statistic  is  D, _  =  0.373.  Therefore,  decide  PN. 

1  O 


Example  6.6. 

1.  Let  Y  =  [Y^,  •  ••»  =  Z  H^g  where  Z  =  fZj ,  . ..,  Zjg] 

the  original  data. 

2.  Generate  X  =  [X, ,  . . . ,  X  „]  i.i.d.  ♦  independent  of  the  original 

*\j  1  1 0 

data. 


18 

Compute  R*  =  (]>  Y?)1/2 

1  J 

and 

R**  = 

18  9 

(I 

1  3 

Generate  X'=[X',  ....  Xl_] 

*\j  1  lo 

where 

Yl  B 

R**  v 

Xj 

R*  : 

Decide  N  +  S  iff 

Dis  =  SUP  1  Tg  I  (2  - 

z  X 

Xj)  - 

*(0| 

>  d(.05 

calculations  yield: 

■N:  X  =  [.089,  .233,  .912, 

— “  rt . 

-1.10, 

,  -.515 

,  1.58, 

-.149,  1.639,  -.798,  -1.888,  .344,  .647,  .718,  -.125] 
R**  =  ( l  XT)1' 1  =  4.467 


18  n  1/9 

R*  =  (I  'i.)  '  «  1.793 

1  J 


1 

* 

1 

1 
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R-S-N:  X' 

=  [1.071, 

-.805,  .568,  1 

.684,  .361,  .802,  .259,  .536,  .914, 

'X, 

-.428, 

1.111,  -1.642, 

-1.278,  -.144,  1.024,  -1.039, 

i 

1.253, 

1.228] 

k  X’ (k) 

*[X’(k)] 

k-1 

k/18  18 

-  *[X* (k)]  »[X'Ck)]  -  ~ 
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Example  6.7. 


Decide  N  +  S  iff 


l  * 

Tl  _  1 _ - -  (—  '  m-. >  p 

N-l  ~  1  m  J  (N-m-l,m,l 

i  i 

m+l  J 


or  <  F 


(N-m-1 ,m,a/2) 


Critical  values  are:  (N  =  19,  m  »  9,  a  =  .05) 


F(9,9, .025)  =  0-25 


F(9,9,.975)  =  4‘03 


The  obtained  statistic  value  is 


T'  ■  CTT  ■  °'45 


Therefore,  decide  PN. 


Numerical  Examples  for  Section  7. 

PN3:  L  =  LQ  e  n(K-S-E)  vs.  N  +  S^:  L  t  LQ 

Example  7,1. 

Data:  Data  Set  A  (cross-sectional) 


-a/2) 


59  - 


1. 


Compute 


S2 

J 


10  ? 

T  ZT  for  1  <  j  <  6 
r=l  - 


s2  *  1  6186 

S2  =  3.0323 

4 


S2  =  2.8889 
S2  =  1.7226 


S2  =  1.4138 
S2  =  3.1484 


2.  Decide  N  ♦  Sj  iff 

=  sup  |G(  (z)  -  G0(z)|  >  d( .05 ,  6) 

C2  5 

F  ,  (wz2)dJ  =  • 

X£  0  l*S^(k) 

2 

Here  w  =  1/a  and  JQ  *  Exp(5). 


Kolmogorov-Smimov  test  on  S. 


S2(k) 


[- 


s2C illJ 


[jq]Q_]5_kii  | 


1 

1.4138 

.0689 

.0689 

.0978 

1.6186 

.0902 

- 

.2431 

1 . 7226 

.1014 

- 

.3986 

2.8889 

.2262 

- 

.4405 

3.0323 

.2405 

- 

.5928 

H 

3.1484 

.2518 

' 

.7482 

D6  ■  '7482  7  8 (6, .95)  ■  -521 


Decide  N  *  S 


-  60 


Numerical  Examples  for  Section  8. 


PN^:  L((Zr-a}}  =  Lq  e  ft(K-S-E)  for  some  a  vs. 


N  +  S4:  L{Zr-a}  ^  LQ  for  any  a 

Example  8.1. 

Data:  First  five  columns  of  data  Set  A 


Zll»  Z1S 


Z61'  Z6S 


1.  Form  new  matrix 


i\,  a.  'r 

Y  =  Z  H5  = 


Yll’  Yl5 


Y61'  Y65 


1.037848 

-  .684439 

-1.323319 

-1.039538 

-  .684434 

-  .119990 

1.347044 

.897589 

-  .352879 

2.121287 

.831179 

-1.372096 

.839572 

-  .128918 

-  .698959 

1.234207 

-  .559795 

.723030 

1.825137 

4,079463 

-1.060587 

1.932456 

-  .786232 

-  .464495 

2.376244 

-2.021135 

-1.815677 

-  .743263 

-  .045593 

.638156 

Compute 

S2 

1 

4  ^ 

=  l  Y2 

for  1  £  j 

<  6 

=  4.377377 


•  2.759115 


-  3.295005 


S‘  =  5.690535 

4 


S*  -  5.693147 


S‘  ■  7.936188 
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Decide  N  *  S.  iff 
4 


D6  =  sup  |G6(2)  -  Gq (z) j  >  d( .05,  6) 
2 

2  5 

G  (2)  =  prsi<z|  i  =  [_ aiilLj 
U  1  +  S  ik) 


[The  mixing  measure  here  is  Exp  (5)  .  ] 


Kolmogorov-Smirnov  Test  on 


S2. 

_1 


k 

s2(k) 

in 

r- 

CM 

C/D 

k. 

r  S2(k)  5  k-1 

k  f  S2(k) 

i  2  1 

l+S^fk) 

2  *  6 

l+SZ(k)  6 

6  1+S2(k) 

mm 

2.759115 

.213020 

.213020 

3.295005 

.265743 

.099076 

.067590 

4.377377 

.357452 

.024119 

.142548 

5.690535 

.445104 

- 

.221563 

5.693147 

.445256 

- 

. 388077 

HI 

7.936188 

.552457 

' 

.447543 

5 

] 


=  .448  <  d(.05,  6)  *  .S21 


Decide  PN 


Numerical  Example  for  Section  9. 

PN5:  L1  “  L2  e  n(K's_E5  vs*  N  +  S5:  Li  i  l2 

9.1  Mann - Wh i tney - Wi 1 coxon  Rank  Sum  Test 

9.2  K-S  two-sample  test 

Data  sets  B,  C  (cross-sectional  data) 
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9.1.  Mann-Whitney-Wilcoxon  Rank  Sum  Test 
Given  two  sets  of  cross-sectional  data: 


Compute  radii  for  each  "look": 


where  m  =  n  =  6 

and  N  = 

12. 

Rj  =  7.4067 

(2) 

R7 

= 

11.3978 

(8) 

R2  =  9.2428 

(5) 

R8 

11.9761 

(4) 

R3  =  13.4661 

(12) 

R9 

m 

11.1978 

(7) 

R.  =  11.7515 
4 

(10) 

R10 

= 

7.4447 

(3) 

R$  =  10.0338 

(6) 

R11 

s 

6.3221 

(1) 

R6  =  8.7033 

(4) 

R12 

= 

11.6063 

(9) 

The  Mann-Whitney  U  statistic  is  computed  as  follows:  (m  *  n  *  6) 

Uj  =  mn  +  S£|*U  .  |  r(r.) 

-  (6)  (6)  +  -  l  (2  +  5  +  12  +  10  +  6  +  4) 

1  1 

*  36  ♦  21  -  39  *  18 

[Note:  This  statistic  is  equivalent  to  the  statistic  in  Decision 
Rule  9.1], 
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„  ,  m  .  -  J  R(«, 

1  1  m+1  5 

=  57  -  (8  +  11  +  7  +  3  +  1  +  9)  =  57  -  39  =  18 

Decide  N  +  S  iff  min{U, ,Uj  <  U,  ,  =  5 

X  fc  o,o, .025 

since  min  ,  U2  >  =  18 
decide  PN 

9.2.  Kolmogorov-Smirnov  2-sample  test 

Here  m  =  n  =  60,  and  the  decision  rule  is: 


Decide  N  ♦  S  iff  D 


(in.n)  >  ^.n.-OS  *  *24®' 


D(60,60)  =  *1667- 


» 


Decide  PN. 
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(A)  Gaussian  Markov  Processes  and  SE  Time  Series 

Consider  the  following  two  Gaussian  Markov  Processes. 

(A)  WL  (Wiener- Levy)  (W(t):  t  >_  0),  where  W(t)  =  y(t)  +  V  1//2W*(t) 
with 

(i)  (W*(t):  t  0}  being  a  Gaussian  Process,  satisfying 

(ii  W*(0)  =  0;  and 
(iii)  E[W*(t)]  =  0; 

(iv)  Cov [W*(s) ,  W*(t)]  =  min(s,t). 

(B)  O-U  (Qmstein-llhlenbeck)  (U(t):  t  >_0},  where 
V(t)  =  y(t)  +  V"1/2U*(t),  with 

(i)  (U*(t):  t  ^  0}  being  a  Gaussian  Process,  satisfying 

(ii)  U* (0)  =  0; 

(iii)  E [U* (t) ]  =  0;  and  Cov[U*(s),  U*(t)]  =  exp  {-y| s-t | } 
for  some  y  >  0. 

When  V  is  a  positive  constant,  the  processes  (A)  and  (B)  above 
are  the  Wiener-Levy  and  Ornstem-Uhlenbeck  processes,  respectively. 

Now  let  V  be  a  positive  random  variable,  i.e.,  one  mixes  the 
processes  and  form  the  random  variables 


a. 

W 

n 

=  W(nA) ; 

b. 

Y 

=  W  -  W 

n 

n  n 

c. 

V 

n 

II 

< 

o 

w 

d. 

Xn 

=  V  -  BV 
n  i 

where  A  >  0. 
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One  can  easily  verify  for  the  (mixed)  W-L  processes  proposition 


(1) 

The 

time 

series 

(Y  : 

n 

n  _>  1}  is 

S-E 

iff 

U(t) 

=  0. 

(2) 

The 

time 

series 

u,  y2,  y3,  . 

is 

S-E 

iff  y (t)  =  y. 

(3) 

The 

time 

series 

{Y  - 
n 

BA:  n  _>  l) 

is 

S-E 

iff 

y(t)  =  Bt  +  a. 

for 

some 

a. 

For  the  (mixed)  0-U  processes  it  is  valid  that 
Theorem  II. 1 

(1)  {X:  n  _>  1}  is  S-E  iff  B  =  exp{-yA) 

(2)  {V^:  n  _>  1}  is  approximately  S-E  iff  yA  is  sufficiently 

large.  [When  yA  >  10,  one  has  approximately  white  noise.] 

(B)  Polar  Coordinates,  Direction  Angles  and  Radial  Distributions. 

For  an  S-E  time  series,  it  is  sometimes  useful  to  formulate 
problems  and  solutions  in  polar  coordinates .  For  an  initial  segment 
of  length  q,  one  has 

Xj  =  R  sin  Oj 

X2  =  R  cos  sin  02 

X  ,  =  R  cos  0,  cos  0„  ...  cos  0  -  cos  0  , 

q-1  1  2  q-2  q-1 

X  =  R  cos  0,  cos  0.  ...  cos  0  -  cos  0  , 
q  12  q-2  q-1 

where  J  0 .  |  <  \  for  j  =  1,  2,  . ..,  q  -  2;  and  |0  |  <  it;  and 

3  *  q-l 

r2=  ?x2. 

1  J 

As  previously  implied  (Section  2.C),  R  and  0  =  (0  ,  0  .  0  ) 

are  independent.  Further,  one  can  prove  (e.g..  Smith  (1969)) 
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Theorem  1 1. 2.  The  direction  angles  6  =  (9 . 0  , )  have 

-  &  ^  1*  q-1' 

joint  density  f(0)  =  4  r(§)  ir  ^2  [cos  0,]q"2  [cos  0  ...  [cos  0  ] . 

%  Z  Z  1  Z  Q-Z 

The  radial  distributions,  joint  densities  and  characteristic  functions 
(See  Table  A.II.l)  are  related  by 


Theorem  II. 3.  For  an  initial  q-segment,  X  =  (X, ,  . . . ,  X  )  of 
-  'v  l  q 

an  S-E  time  series,  (i)  the  characteristic  function  is  of  the  form 

'v  9  2 

=  <Kl  t.)  (ii)  the  density  (if  it  exists)  has  the  form 
1  J 

^  9  2 

f(x)  =  cq  <J) (2,  Xj),  and  (iii)  R  has  (radial)  density  2(r)  = 

2Cq  7rq/2  [r(|)]_1  rq_1  <t(r2). 


% 

Several  of  the  more  important  densities,  f(-);  characteristic 
% 

functions,  <)>(•)  J  and  radial  distributions  g(-)  are  given  in  Table  II.l 
below. 

These  are  given  for  initial  segments  of  length  "q",  in  general, 
and  for  q  =  1,  2,  and  3.  Further  a  scale  parameter  "a"  is 
included  in  all  rows  except  the  first  row.  The  characteristic  function 


is  intractable  in  the  uniform  cases  and  has  been  omitted. 


